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Abstract 

We discuss an algorithm which produces the meaning of a sentence given 
meanings of its words, and its resemblance to quantum teleportation. I fact, 
this protocol was the main source of inspiration for this algorithm which has 
many applications in the area of Natural Language Processing. 

Quantum teleportation [0 is one of the most conceptually challenging and 
practically useful concepts that has emerged from the quantum information revo- 
lution. For example, via logic-gate teleportation [fT6l it gave rise to the measurement- 
based computational model, it also plays a key role in current investigations into 
the nature of quantum correlations, e.g. Il23l . and it even has been proposed as 
a model for time travel O. It also formed the cornerstone for a new axiomatic 
approach and diagrammatic calculus for quantum theory dH |Vl [8J . 
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Arguably, when such a radically new concept emerges in a novel foundational 
area of scientific investigation, one may expect that the resulting conceptual and 
structural insights could also lead to progress in other areas, something which has 
happened on many occasions in the history of physics. In the context of quantum 
information, for example, it is well-known that quantum complexity theory has 
helped to solve many problems in classical complexity theory. 

Here we explain how a high-level description of quantum teleportation with 
emphasis on information flows has successfully helped to solve a longstanding 
open problem in the area of Natural Language Processing (NLP), and the problem 
of modeling meaning for natural language more generally [|9l Q3- This work 
featured as a cover heading in the New Scientist (11 Dec. 2011) 112211 . and has 
been experimentally tested for its capability to perform key NLP tasks such as 
word sense disambiguation in context |[T5l ^] 

The NLP problem. Dictionaries explain the meanings of words; however, in 
natural language words are organized as sentences, but we don't have dictionaries 
that explain the meanings of sentences. Still, a sentence carries more information 
than the words it is made up from; e.g. meaning(Alice sends a message to 
Bob) 7^ meaning(Bob sends a message to Alice). Evidently, this is where 
grammatical structure comes into play. Consequently, we as humans must use 
some algorithm that converts the meanings of words, via the grammatical struc- 
ture, into the meaning of a sentence. All of this may seem to be only of academic 
interest; however, search engines such as Google face exactly the same challenge. 
They typically read a string of words as a 'bag of words', ignoring the grammat- 
ical structure. This is simply because (until recently) there was no mathematical 
model for assigning meanings to sentences]^] On the other hand, there is a widely 
used model for word meaning, the vector space model [|24l . 

This vector space model of word meaning works as follows. One chooses a 
set of context words which will form the basis vectors of a vector space Given 
a word to which one wishes to assign meaning, e.g. Alice', one relies on a large 
corpus, e.g. (part of) the web, to establish the relative frequency that Alice' occurs 

'EMNLP is the leading conference on corpus-based experimental NLP. 

2 More precisely, there was no mathematical model for assigning meanings to sentences that 
went beyond truthfulness. Montague semantics [21] is a compositional model of meaning, but at 
most assigns truth values to sentences, and evidently there is more to sentence meaning than the 
mere binary assignment of either true or false. 

3 These context words may include nouns, verbs etc.; the vector space model built from the 
British National Corpus typically contains 10s of thousands of these words as basis vectors. 
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'close' to each of these basis words. The list of all these relative frequencies yields 
a vector that represents this word, its meaning vector. Now, if one wants to verify 
synonymy of two words, it suffices to compute the inner-product of the meaning 
vectors of these words, and verify how close it is to 1 . Indeed, since synonyms are 
interchangeable, one would expect them to typically occur in the context of the 
same words, and hence their meaning vectors should be the same in the statistical 
limit. For example, in a corpus mainly consisting of computer science literature, 
one would expect Alice and Bob to always occur in the same context and hence 
their meaning vectors would almost be the same. Of course, if the corpus were 
English literature (cf. [4]), then this similarity would break down. 

Until recently, the state of affairs in computational linguistics was one of two 
separate communities lfT3"1 . One community focused on non-compositional purely 
distributional methods such as the vector space model described above. The other 
community studied the compositional mathematical structure of sentences, build- 
ing on work by Chomsky 0, Lambek lfT8l and Montague 11211 . This work is 
mainly about the grammatical structure of sentences; grammatical type calculi 
are algebraic gadgets that allow one to verify whether a sentence has a correct 
grammatical structure. 

Caps, cups, and teleportation. In [1], a novel axiomatic framework was pro- 
posed to reason about quantum informatic processes, which admits a sound and 
faithful purely diagrammatic calculus 0; for some more recent developments 
we refer to JS). Ideal post-selected teleportation provides the cornerstone for the 
diagrammatic reasoning techniques, e.g. here is the derivation of the general tele- 
portation protocol where the /-label represents both the measurement outcome 
and the corresponding correction performed by Bob : 
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The main conceptual idea behind these diagrams is that, besides their operational 
physical meaning, they also admit a 'logical reading' in terms of information flow. 
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Here, the red line represents the logical flow which indicates that the state incom- 
ing at Alice's side first gets acted upon by an operation /, and then by its adjoint 
p, which in the case that / is unitary results in the outgoing state at Bob's side 
being identical to the incoming one at Alice's sidej^] 

When interpreted in Hilbert space, the key ingredients of this formalism are 
'cups' and 'caps': 

:=|00) + |11) f (00| + (11| 

and the equation that governs them is: 

(((00| + (11|) <8> Id)(Id® (| 00) + |11))) = Id 
which diagrammatic ally depicts as: 

PU = I 

4 This 'logical reading' of projectors on entangled states in terms of information flow was first 
proposed by one of the authors in 0. 
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In this language the Choi-Jamiolkowski isomorphism: 




= (ld®/)(|00) + |ll)) 



interprets a bipartite state (the grey triangle) as a 'cup' which changes the direction 
of the information flow together with an operation / that alters the information. 
Non-separatedness means topological connectedness: 




which is interpreted as the fact that information can flow between the two sys- 
tems involved. Hence, when focussing on pure states, the cups effectively witness 
entanglement in terms of the information flows that it enables. 

It is exactly this interpretation of the vectors representing the states of com- 
pound quantum systems in terms of enabling information flows that will provide 
the cornerstone for our compositional and distributional model of meaning. 

Solution to the NLP problem: the intuition. Before we explain the precise 
algorithm that produces sentence meaning from word meaning, we provide the 
analogy with the above. 

A transitive verb requires both an object and a subject to yield a grammatically 
correct sentence. Consider the sentence "Alice hates Bob". Assume that the words 
in it are represented by vectors, which as above we denote by triangles: 



Alice (g> hates <E> Boi 

Note here that treating verbs as 'compound' was already the case in grammatical 
type calculi, as we discuss below. So how do these words interact to produce the 
meaning of a sentence? For the verb to produce the meaning of the sentence, that 
is, the statement of the fact that Alice hates Bob, it of course needs to know what 
its subject and object are, that is, it requires knowing their meanings. Therefore, 
inspired by the above discussion on teleportation, we 'feed' the meaning vectors 
Alice and Boh into the verb hates which then 'spits out' the meaning of the 
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sentence: 




Again, that the meaning of the sentence is produced by the transitive verb after 
interacting with its nouns is also something that was the case in grammatical type 
calculi. 

In the same vein, for an intransitive verb, we obtain an even more direct ana- 
logue to quantum teleportation: 



<8> Id J (Alice <g) dreams) = 

Note here that non-separatedness of verbs is obvious: if in the sentence "Alice 
hates Bob" hates would be disconnected, then the meaning of the sentence would 
not depend on the meanings of "Alice" and "Bob", so, "Anyone hates everyone"! 




A grammatical type calculus: Lambek's pregroups. In order to give a precise 
description of our algorithm we now give a brief account of Lambek's pregroup 
grammar ffl9l|20llg] 

Pregroups capture structural similarities across a wide range of language fam- 
ilies [201 . They combine a remnant of group structure with partial ordering; the 
usual (left and right) group laws for the inverse are replaced by four inequalities 
involving distinct left and right pseudo-inverses x^ 1 and ~ x x: 

x^ 1 ■ X < 1 < x ■ x~ x x ■ ~ x < 1 < _1 x • X . 

As a grammatical type calculus, its elements are basic grammatical types, e.g. the 
noun type n and sentence type s. Other types arise from the pseudo-inverses and 

5 An interesting aside: Lambek published his paper on the widely used Lambek grammars in 
1958 [ 1 8 1 . His recent book on pregroups appeared 50 years later [20 1 ! All this time Lambek 
worked in Montreal, the location of the upcoming QIP, and it was also in Montreal (in 2004) that 
during a talk by one of us he first pointed to the structural coincidence between pregroup grammars 
and quantum axiomatics in terms of cups and caps. 



6 



the multiplication, e.g. the transitive verb type tv : = 1 n ■ s ■ n 1 . Then: 

... e.g. hates „ . 

e.g. Alice e.g. Boo 

'"~n~ v - ~ 1 n ■ s ■ n^ 1 ■ n = (n ■ ~ l n) ■ s ■ (n~ l ■ n) 

< 1 • s ■ 1 

= s 

Such an inequality n-tv-n < s then stands for the fact that "noun - transitive verb 
- noun" is a valid grammatical structure for a sentence. Note here the correspon- 
dence with our interpretation of such a sentence in terms of information flow: the 
verb requires two nouns to be 'fed into it' to yield a sentence type; n^ 1 and 
capture 'the verb requests type n on the left/right' . In fact, the inequalities using 
n ■ < 1 and rT x ■ n < 1 can also be represented with 'directed' caps: 

s 



n ■ 1 n ■ s ■ n 1 ■ n 
which represent the inequalities: 

1 1 

(^*~\ ^ vi v| 

n ■ ~ 1 n n" 1 ■ n 

Moreover, a pregroup can be defined in terms of cups and caps. In category theo- 
retic language, both the diagrammatic language for quantum axiomatics and pre- 
groups are so-called compact closed categories |[T7ll25l : while the quantum lan- 
guage is symmetric, pregroups have to be non-symmetric given the importance of 
word-order in sentences. 



Solution to the NLP problem: the algorithm. Assume a grammatically well- 
typed sentence and a meaning vector ~&j for each of its words, which we assume to 
be represented in a vector space of which the tensor structure matches the structure 
of its grammatical typej^je.g.: 

n^V tv = ~ l n ■ s ■ n' 1 ~» V <g> W <g> V 

where W is the vector space in which we intend to represent the meanings of 
sentences. Then one proceeds as follows: 

6 How this can be achieved within the context of the vector space model of meaning is outlined 
in D3J and used in lfT31 . 
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1 . Compute the tensor product Words — ~&\ <g) . . . ® ~&k of the word meaning 
vectors in order of appearance in the sentence; e.g. noun\ ® verb <g> noun\. 

2. Construct a linear map / that represents the type reduction as follows: given 
the diagram that represents a type reduction (cf. ([T]) above), we interpret 
caps as an d straight wire as identities; e.g. <8> Id® 

3. Compute Sentence := / '(Words) G W. 

Hence the crux is: the grammatical correctness verification procedure be- 
comes an actual linear map that transforms the meanings of words into the mean- 
ing of the sentence by making these words interact via caps. Does it work? The 
proof is in the pudding. Proof-of-concept examples are in lfT2l . and concrete ex- 
perimentally verified applications are in |fT5l . 

We invite the reader to also look at |[T2l for the example sentence: 

"Alice does not like Bob" , 



where "does" and "not" are assigned not empirical but 'logical' meanings: 









^ grammar 
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t \/\Uk^ \BobX 
meaning vectors of words 



resulting in a more interesting information flow structure. In ongoing work we 
investigate how the structures that are used to represent classical data flow in the 
quantum teleportation protocol enable one to model more of these 'logical words' . 
For example, in [10] this was done for relative pronouns such as "who", "which", 
"that" and "whose". 

Finally, while the well-established structural similarities across language fam- 
ilies in terms of grammatical type calculi may seem mysterious, the teleportation- 
like information flow interpretation presented here clearly explains them. 
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